Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 1296675 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 243.6 MiB |
| Average record size in memory | 197.0 B |
Variable types
| Numeric | 17 |
|---|---|
| Text | 6 |
| Categorical | 4 |
| Boolean | 13 |
amt is highly overall correlated with amt_day_interaction and 2 other fields | High correlation |
amt_day_interaction is highly overall correlated with amt and 2 other fields | High correlation |
amt_merchant_interaction is highly overall correlated with amt and 2 other fields | High correlation |
amt_ratio is highly overall correlated with amt and 2 other fields | High correlation |
day is highly overall correlated with day_of_month | High correlation |
day_of_month is highly overall correlated with day | High correlation |
day_of_week is highly overall correlated with amt_day_interaction and 1 other fields | High correlation |
is_weekend is highly overall correlated with day_of_week | High correlation |
merchant_encoded is highly overall correlated with amt_merchant_interaction | High correlation |
is_fraud is highly imbalanced (94.9%) | Imbalance |
high_value is highly imbalanced (71.4%) | Imbalance |
category_food_dining is highly imbalanced (63.2%) | Imbalance |
category_gas_transport is highly imbalanced (52.6%) | Imbalance |
category_grocery_net is highly imbalanced (78.1%) | Imbalance |
category_grocery_pos is highly imbalanced (54.6%) | Imbalance |
category_health_fitness is highly imbalanced (64.8%) | Imbalance |
category_home is highly imbalanced (54.7%) | Imbalance |
category_kids_pets is highly imbalanced (57.3%) | Imbalance |
category_misc_net is highly imbalanced (71.9%) | Imbalance |
category_misc_pos is highly imbalanced (66.7%) | Imbalance |
category_personal_care is highly imbalanced (63.4%) | Imbalance |
category_shopping_net is highly imbalanced (61.5%) | Imbalance |
category_shopping_pos is highly imbalanced (56.4%) | Imbalance |
category_travel is highly imbalanced (79.9%) | Imbalance |
amt is highly skewed (γ1 = 42.27787379) | Skewed |
amt_ratio is highly skewed (γ1 = 34.80955403) | Skewed |
amt_merchant_interaction is highly skewed (γ1 = 50.34670773) | Skewed |
amt_day_interaction is highly skewed (γ1 = 62.28714456) | Skewed |
day_of_week has 254282 (19.6%) zeros | Zeros |
hour has 42502 (3.3%) zeros | Zeros |
amt_day_interaction has 254282 (19.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-09-15 08:45:47.343690 |
|---|---|
| Analysis finished | 2024-09-15 08:49:10.288925 |
| Duration | 3 minutes and 22.95 seconds |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
amt
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 52928 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.351035 |
| Minimum | 1 |
|---|---|
| Maximum | 28948.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.44 |
| Q1 | 9.65 |
| median | 47.52 |
| Q3 | 83.14 |
| 95-th percentile | 196.31 |
| Maximum | 28948.9 |
| Range | 28947.9 |
| Interquartile range (IQR) | 73.49 |
Descriptive statistics
| Standard deviation | 160.31604 |
|---|---|
| Coefficient of variation (CV) | 2.2788014 |
| Kurtosis | 4545.645 |
| Mean | 70.351035 |
| Median Absolute Deviation (MAD) | 37.5 |
| Skewness | 42.277874 |
| Sum | 91222429 |
| Variance | 25701.232 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.14 | 542 | < 0.1% |
| 1.04 | 538 | < 0.1% |
| 1.25 | 535 | < 0.1% |
| 1.02 | 533 | < 0.1% |
| 1.01 | 523 | < 0.1% |
| 1.05 | 519 | < 0.1% |
| 1.2 | 516 | < 0.1% |
| 1.23 | 515 | < 0.1% |
| 1.08 | 512 | < 0.1% |
| 1.11 | 509 | < 0.1% |
| Other values (52918) | 1291433 |
| Value | Count | Frequency (%) |
| 1 | 222 | |
| 1.01 | 523 | |
| 1.02 | 533 | |
| 1.03 | 499 | |
| 1.04 | 538 | |
| 1.05 | 519 | |
| 1.06 | 471 | |
| 1.07 | 498 | |
| 1.08 | 512 | |
| 1.09 | 496 |
| Value | Count | Frequency (%) |
| 28948.9 | 1 | |
| 27390.12 | 1 | |
| 27119.77 | 1 | |
| 26544.12 | 1 | |
| 25086.94 | 1 | |
| 17897.24 | 1 | |
| 15305.95 | 1 | |
| 15047.03 | 1 | |
| 15034.18 | 1 | |
| 14849.74 | 1 |
first
Text
| Distinct | 352 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 6.0804319 |
| Min length | 3 |
Characters and Unicode
| Total characters | 7884344 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Jennifer |
|---|---|
| 2nd row | Stephanie |
| 3rd row | Edward |
| 4th row | Jeremy |
| 5th row | Tyler |
| Value | Count | Frequency (%) |
| christopher | 26669 | 2.1% |
| robert | 21667 | 1.7% |
| jessica | 20581 | 1.6% |
| james | 20039 | 1.5% |
| michael | 20009 | 1.5% |
| david | 19965 | 1.5% |
| jennifer | 16940 | 1.3% |
| william | 16371 | 1.3% |
| mary | 16346 | 1.3% |
| john | 16325 | 1.3% |
| Other values (342) | 1101763 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1007700 | 12.8% |
| e | 860878 | 10.9% |
| i | 618247 | 7.8% |
| n | 614453 | 7.8% |
| r | 607072 | 7.7% |
| l | 388220 | 4.9% |
| h | 344993 | 4.4% |
| s | 324237 | 4.1% |
| t | 311569 | 4.0% |
| o | 268849 | 3.4% |
| Other values (39) | 2538126 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6587669 | |
| Uppercase Letter | 1296675 | 16.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1007700 | |
| e | 860878 | |
| i | 618247 | |
| n | 614453 | |
| r | 607072 | |
| l | 388220 | 5.9% |
| h | 344993 | 5.2% |
| s | 324237 | 4.9% |
| t | 311569 | 4.7% |
| o | 268849 | 4.1% |
| Other values (16) | 1241451 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 218907 | |
| M | 144916 | |
| S | 114469 | |
| A | 112464 | |
| C | 106121 | |
| D | 86078 | 6.6% |
| K | 85426 | 6.6% |
| R | 70457 | 5.4% |
| T | 66590 | 5.1% |
| L | 62879 | 4.8% |
| Other values (13) | 228368 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7884344 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1007700 | 12.8% |
| e | 860878 | 10.9% |
| i | 618247 | 7.8% |
| n | 614453 | 7.8% |
| r | 607072 | 7.7% |
| l | 388220 | 4.9% |
| h | 344993 | 4.4% |
| s | 324237 | 4.1% |
| t | 311569 | 4.0% |
| o | 268849 | 3.4% |
| Other values (39) | 2538126 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7884344 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1007700 | 12.8% |
| e | 860878 | 10.9% |
| i | 618247 | 7.8% |
| n | 614453 | 7.8% |
| r | 607072 | 7.7% |
| l | 388220 | 4.9% |
| h | 344993 | 4.4% |
| s | 324237 | 4.1% |
| t | 311569 | 4.0% |
| o | 268849 | 3.4% |
| Other values (39) | 2538126 |
last
Text
| Distinct | 481 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.1111774 |
| Min length | 2 |
Characters and Unicode
| Total characters | 7924211 |
|---|---|
| Distinct characters | 48 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Banks |
|---|---|
| 2nd row | Gill |
| 3rd row | Sanchez |
| 4th row | White |
| 5th row | Garcia |
| Value | Count | Frequency (%) |
| smith | 28794 | 2.2% |
| williams | 23605 | 1.8% |
| davis | 21910 | 1.7% |
| johnson | 20034 | 1.5% |
| rodriguez | 17394 | 1.3% |
| martinez | 14805 | 1.1% |
| jones | 13976 | 1.1% |
| lewis | 12753 | 1.0% |
| gonzalez | 11799 | 0.9% |
| miller | 11698 | 0.9% |
| Other values (471) | 1119907 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 786302 | 9.9% |
| r | 658748 | 8.3% |
| a | 648005 | 8.2% |
| n | 609178 | 7.7% |
| o | 583517 | 7.4% |
| l | 489180 | 6.2% |
| s | 487668 | 6.2% |
| i | 435378 | 5.5% |
| t | 288591 | 3.6% |
| h | 228981 | 2.9% |
| Other values (38) | 2708663 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6627536 | |
| Uppercase Letter | 1296675 | 16.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 786302 | |
| r | 658748 | |
| a | 648005 | |
| n | 609178 | |
| o | 583517 | |
| l | 489180 | 7.4% |
| s | 487668 | 7.4% |
| i | 435378 | 6.6% |
| t | 288591 | 4.4% |
| h | 228981 | 3.5% |
| Other values (15) | 1411988 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 158701 | |
| W | 106490 | 8.2% |
| S | 105221 | 8.1% |
| C | 93308 | 7.2% |
| B | 84092 | 6.5% |
| R | 83194 | 6.4% |
| H | 81444 | 6.3% |
| G | 75241 | 5.8% |
| J | 71781 | 5.5% |
| P | 66087 | 5.1% |
| Other values (13) | 371116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7924211 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 786302 | 9.9% |
| r | 658748 | 8.3% |
| a | 648005 | 8.2% |
| n | 609178 | 7.7% |
| o | 583517 | 7.4% |
| l | 489180 | 6.2% |
| s | 487668 | 6.2% |
| i | 435378 | 5.5% |
| t | 288591 | 3.6% |
| h | 228981 | 2.9% |
| Other values (38) | 2708663 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7924211 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 786302 | 9.9% |
| r | 658748 | 8.3% |
| a | 648005 | 8.2% |
| n | 609178 | 7.7% |
| o | 583517 | 7.4% |
| l | 489180 | 6.2% |
| s | 487668 | 6.2% |
| i | 435378 | 5.5% |
| t | 288591 | 3.6% |
| h | 228981 | 2.9% |
| Other values (38) | 2708663 |
gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| F | |
|---|---|
| M |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1296675 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| F | 709863 | |
| M | 586812 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 709863 | |
| m | 586812 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 709863 | |
| M | 586812 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1296675 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 709863 | |
| M | 586812 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1296675 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 709863 | |
| M | 586812 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1296675 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 709863 | |
| M | 586812 |
street
Text
| Distinct | 983 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 29 |
| Mean length | 22.229027 |
| Min length | 12 |
Characters and Unicode
| Total characters | 28823823 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 561 Perry Cove |
|---|---|
| 2nd row | 43039 Riley Greens Suite 393 |
| 3rd row | 594 White Dale Suite 530 |
| 4th row | 9443 Cynthia Court Apt. 038 |
| 5th row | 408 Bradley Rest |
| Value | Count | Frequency (%) |
| apt | 327791 | 6.4% |
| suite | 305467 | 5.9% |
| island | 22954 | 0.4% |
| michael | 18967 | 0.4% |
| common | 17978 | 0.3% |
| station | 17957 | 0.3% |
| islands | 17917 | 0.3% |
| david | 17476 | 0.3% |
| brooks | 16991 | 0.3% |
| fields | 16321 | 0.3% |
| Other values (1940) | 4376722 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3859866 | 13.4% | |
| e | 1792676 | 6.2% |
| a | 1454190 | 5.0% |
| i | 1296969 | 4.5% |
| t | 1248091 | 4.3% |
| r | 1103208 | 3.8% |
| n | 1066149 | 3.7% |
| s | 1034564 | 3.6% |
| l | 889594 | 3.1% |
| o | 875571 | 3.0% |
| Other values (52) | 14202945 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14413030 | |
| Decimal Number | 6996528 | |
| Space Separator | 3859866 | 13.4% |
| Uppercase Letter | 3226608 | 11.2% |
| Other Punctuation | 327791 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1792676 | |
| a | 1454190 | |
| i | 1296969 | |
| t | 1248091 | |
| r | 1103208 | 7.7% |
| n | 1066149 | 7.4% |
| s | 1034564 | 7.2% |
| l | 889594 | 6.2% |
| o | 875571 | 6.1% |
| u | 613916 | 4.3% |
| Other values (16) | 3038102 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 561924 | |
| A | 421707 | |
| M | 258180 | 8.0% |
| C | 223839 | 6.9% |
| P | 195864 | 6.1% |
| R | 186303 | 5.8% |
| B | 148676 | 4.6% |
| F | 143149 | 4.4% |
| L | 131665 | 4.1% |
| J | 121164 | 3.8% |
| Other values (14) | 834137 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 748812 | |
| 3 | 739928 | |
| 2 | 734719 | |
| 7 | 703124 | |
| 1 | 693880 | |
| 8 | 692585 | |
| 6 | 677709 | |
| 0 | 677245 | |
| 4 | 669799 | |
| 9 | 658727 |
Space Separator
| Value | Count | Frequency (%) |
| 3859866 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 327791 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17639638 | |
| Common | 11184185 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1792676 | 10.2% |
| a | 1454190 | 8.2% |
| i | 1296969 | 7.4% |
| t | 1248091 | 7.1% |
| r | 1103208 | 6.3% |
| n | 1066149 | 6.0% |
| s | 1034564 | 5.9% |
| l | 889594 | 5.0% |
| o | 875571 | 5.0% |
| u | 613916 | 3.5% |
| Other values (40) | 6264710 |
Common
| Value | Count | Frequency (%) |
| 3859866 | ||
| 5 | 748812 | 6.7% |
| 3 | 739928 | 6.6% |
| 2 | 734719 | 6.6% |
| 7 | 703124 | 6.3% |
| 1 | 693880 | 6.2% |
| 8 | 692585 | 6.2% |
| 6 | 677709 | 6.1% |
| 0 | 677245 | 6.1% |
| 4 | 669799 | 6.0% |
| Other values (2) | 986518 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28823823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3859866 | 13.4% | |
| e | 1792676 | 6.2% |
| a | 1454190 | 5.0% |
| i | 1296969 | 4.5% |
| t | 1248091 | 4.3% |
| r | 1103208 | 3.8% |
| n | 1066149 | 3.7% |
| s | 1034564 | 3.6% |
| l | 889594 | 3.1% |
| o | 875571 | 3.0% |
| Other values (52) | 14202945 |
city
Text
| Distinct | 894 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 8.6522459 |
| Min length | 3 |
Characters and Unicode
| Total characters | 11219151 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Moravian Falls |
|---|---|
| 2nd row | Orient |
| 3rd row | Malad City |
| 4th row | Boulder |
| 5th row | Doe Hill |
| Value | Count | Frequency (%) |
| city | 21314 | 1.3% |
| west | 19473 | 1.2% |
| north | 14425 | 0.9% |
| saint | 14363 | 0.9% |
| falls | 12794 | 0.8% |
| new | 11842 | 0.7% |
| mount | 11375 | 0.7% |
| lake | 11249 | 0.7% |
| san | 10260 | 0.6% |
| springs | 8727 | 0.5% |
| Other values (918) | 1482445 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1090254 | 9.7% |
| a | 935089 | 8.3% |
| n | 821831 | 7.3% |
| o | 817806 | 7.3% |
| l | 781662 | 7.0% |
| r | 748921 | 6.7% |
| i | 704285 | 6.3% |
| t | 598490 | 5.3% |
| s | 446306 | 4.0% |
| 321592 | 2.9% | |
| Other values (42) | 3952915 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9277246 | |
| Uppercase Letter | 1619290 | 14.4% |
| Space Separator | 321592 | 2.9% |
| Dash Punctuation | 1023 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1090254 | |
| a | 935089 | |
| n | 821831 | |
| o | 817806 | |
| l | 781662 | 8.4% |
| r | 748921 | 8.1% |
| i | 704285 | 7.6% |
| t | 598490 | 6.5% |
| s | 446306 | 4.8% |
| d | 309005 | 3.3% |
| Other values (15) | 2023597 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 156587 | 9.7% |
| M | 147711 | 9.1% |
| S | 136036 | 8.4% |
| B | 133396 | 8.2% |
| H | 115641 | 7.1% |
| W | 95433 | 5.9% |
| P | 92084 | 5.7% |
| L | 86511 | 5.3% |
| R | 79150 | 4.9% |
| A | 74999 | 4.6% |
| Other values (15) | 501742 |
Space Separator
| Value | Count | Frequency (%) |
| 321592 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1023 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10896536 | |
| Common | 322615 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1090254 | 10.0% |
| a | 935089 | 8.6% |
| n | 821831 | 7.5% |
| o | 817806 | 7.5% |
| l | 781662 | 7.2% |
| r | 748921 | 6.9% |
| i | 704285 | 6.5% |
| t | 598490 | 5.5% |
| s | 446306 | 4.1% |
| d | 309005 | 2.8% |
| Other values (40) | 3642887 |
Common
| Value | Count | Frequency (%) |
| 321592 | ||
| - | 1023 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11219151 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1090254 | 9.7% |
| a | 935089 | 8.3% |
| n | 821831 | 7.3% |
| o | 817806 | 7.3% |
| l | 781662 | 7.0% |
| r | 748921 | 6.7% |
| i | 704285 | 6.3% |
| t | 598490 | 5.3% |
| s | 446306 | 4.0% |
| 321592 | 2.9% | |
| Other values (42) | 3952915 |
state
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2593350 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NC |
|---|---|
| 2nd row | WA |
| 3rd row | ID |
| 4th row | MT |
| 5th row | VA |
| Value | Count | Frequency (%) |
| tx | 94876 | 7.3% |
| ny | 83501 | 6.4% |
| pa | 79847 | 6.2% |
| ca | 56360 | 4.3% |
| oh | 46480 | 3.6% |
| mi | 46154 | 3.6% |
| il | 43252 | 3.3% |
| fl | 42671 | 3.3% |
| al | 40989 | 3.2% |
| mo | 38403 | 3.0% |
| Other values (41) | 724142 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 355776 | |
| N | 284464 | 11.0% |
| M | 220694 | 8.5% |
| I | 181993 | 7.0% |
| T | 154353 | 6.0% |
| L | 147877 | 5.7% |
| O | 144031 | 5.6% |
| C | 141011 | 5.4% |
| Y | 131298 | 5.1% |
| X | 94876 | 3.7% |
| Other values (14) | 736977 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2593350 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 355776 | |
| N | 284464 | 11.0% |
| M | 220694 | 8.5% |
| I | 181993 | 7.0% |
| T | 154353 | 6.0% |
| L | 147877 | 5.7% |
| O | 144031 | 5.6% |
| C | 141011 | 5.4% |
| Y | 131298 | 5.1% |
| X | 94876 | 3.7% |
| Other values (14) | 736977 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2593350 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 355776 | |
| N | 284464 | 11.0% |
| M | 220694 | 8.5% |
| I | 181993 | 7.0% |
| T | 154353 | 6.0% |
| L | 147877 | 5.7% |
| O | 144031 | 5.6% |
| C | 141011 | 5.4% |
| Y | 131298 | 5.1% |
| X | 94876 | 3.7% |
| Other values (14) | 736977 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2593350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 355776 | |
| N | 284464 | 11.0% |
| M | 220694 | 8.5% |
| I | 181993 | 7.0% |
| T | 154353 | 6.0% |
| L | 147877 | 5.7% |
| O | 144031 | 5.6% |
| C | 141011 | 5.4% |
| Y | 131298 | 5.1% |
| X | 94876 | 3.7% |
| Other values (14) | 736977 |
city_pop
Real number (ℝ)
| Distinct | 879 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88824.441 |
| Minimum | 23 |
|---|---|
| Maximum | 2906700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 139 |
| Q1 | 743 |
| median | 2456 |
| Q3 | 20328 |
| 95-th percentile | 525713 |
| Maximum | 2906700 |
| Range | 2906677 |
| Interquartile range (IQR) | 19585 |
Descriptive statistics
| Standard deviation | 301956.36 |
|---|---|
| Coefficient of variation (CV) | 3.3994738 |
| Kurtosis | 37.614519 |
| Mean | 88824.441 |
| Median Absolute Deviation (MAD) | 2198 |
| Skewness | 5.5938531 |
| Sum | 1.1517643 × 1011 |
| Variance | 9.1177644 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 606 | 5496 | 0.4% |
| 1595797 | 5130 | 0.4% |
| 1312922 | 5075 | 0.4% |
| 1766 | 4574 | 0.4% |
| 241 | 4533 | 0.3% |
| 2906700 | 4168 | 0.3% |
| 276002 | 4155 | 0.3% |
| 302 | 4147 | 0.3% |
| 910148 | 4073 | 0.3% |
| 198 | 4067 | 0.3% |
| Other values (869) | 1251257 |
| Value | Count | Frequency (%) |
| 23 | 2049 | |
| 37 | 1013 | 0.1% |
| 43 | 2034 | |
| 46 | 3040 | |
| 47 | 511 | < 0.1% |
| 49 | 1054 | 0.1% |
| 51 | 1016 | 0.1% |
| 52 | 518 | < 0.1% |
| 53 | 2610 | |
| 60 | 1045 | 0.1% |
| Value | Count | Frequency (%) |
| 2906700 | 4168 | |
| 2504700 | 2033 | 0.2% |
| 2383912 | 521 | < 0.1% |
| 1595797 | 5130 | |
| 1577385 | 2563 | |
| 1526206 | 3517 | |
| 1417793 | 8 | < 0.1% |
| 1382480 | 2056 | |
| 1312922 | 5075 | |
| 1263321 | 3629 |
job
Text
| Distinct | 494 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 38 |
| Mean length | 20.227102 |
| Min length | 3 |
Characters and Unicode
| Total characters | 26227978 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Psychologist, counselling |
|---|---|
| 2nd row | Special educational needs teacher |
| 3rd row | Nature conservation officer |
| 4th row | Patent attorney |
| 5th row | Dance movement psychotherapist |
| Value | Count | Frequency (%) |
| engineer | 131756 | 4.6% |
| officer | 110915 | 3.9% |
| manager | 61124 | 2.1% |
| scientist | 55878 | 1.9% |
| designer | 52218 | 1.8% |
| surveyor | 49062 | 1.7% |
| teacher | 38126 | 1.3% |
| psychologist | 32600 | 1.1% |
| research | 29754 | 1.0% |
| editor | 28725 | 1.0% |
| Other values (456) | 2289024 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2803032 | 10.7% |
| i | 2386346 | 9.1% |
| r | 2198669 | 8.4% |
| a | 1813638 | 6.9% |
| t | 1782302 | 6.8% |
| n | 1764769 | 6.7% |
| 1582507 | 6.0% | |
| o | 1491775 | 5.7% |
| s | 1444701 | 5.5% |
| c | 1323152 | 5.0% |
| Other values (43) | 7637087 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22784440 | |
| Space Separator | 1582507 | 6.0% |
| Uppercase Letter | 1369269 | 5.2% |
| Other Punctuation | 443484 | 1.7% |
| Close Punctuation | 24139 | 0.1% |
| Open Punctuation | 24139 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2803032 | |
| i | 2386346 | |
| r | 2198669 | |
| a | 1813638 | 8.0% |
| t | 1782302 | 7.8% |
| n | 1764769 | 7.7% |
| o | 1491775 | 6.5% |
| s | 1444701 | 6.3% |
| c | 1323152 | 5.8% |
| l | 999624 | 4.4% |
| Other values (16) | 4776432 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 156704 | |
| E | 145426 | |
| P | 143111 | |
| S | 137500 | |
| T | 113148 | 8.3% |
| M | 89545 | 6.5% |
| A | 88466 | 6.5% |
| F | 68651 | 5.0% |
| D | 58034 | 4.2% |
| R | 55841 | 4.1% |
| Other values (11) | 312843 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 312210 | |
| / | 123567 | 27.9% |
| ' | 7707 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1582507 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 24139 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 24139 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24153709 | |
| Common | 2074269 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2803032 | |
| i | 2386346 | 9.9% |
| r | 2198669 | 9.1% |
| a | 1813638 | 7.5% |
| t | 1782302 | 7.4% |
| n | 1764769 | 7.3% |
| o | 1491775 | 6.2% |
| s | 1444701 | 6.0% |
| c | 1323152 | 5.5% |
| l | 999624 | 4.1% |
| Other values (37) | 6145701 |
Common
| Value | Count | Frequency (%) |
| 1582507 | ||
| , | 312210 | 15.1% |
| / | 123567 | 6.0% |
| ) | 24139 | 1.2% |
| ( | 24139 | 1.2% |
| ' | 7707 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26227978 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2803032 | 10.7% |
| i | 2386346 | 9.1% |
| r | 2198669 | 8.4% |
| a | 1813638 | 6.9% |
| t | 1782302 | 6.8% |
| n | 1764769 | 6.7% |
| 1582507 | 6.0% | |
| o | 1491775 | 5.7% |
| s | 1444701 | 5.5% |
| c | 1323152 | 5.0% |
| Other values (43) | 7637087 |
unix_time
Real number (ℝ)
| Distinct | 1274823 |
|---|---|
| Distinct (%) | 98.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3492436 × 109 |
| Minimum | 1.325376 × 109 |
|---|---|
| Maximum | 1.3718168 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 1.325376 × 109 |
|---|---|
| 5-th percentile | 1.328672 × 109 |
| Q1 | 1.3387507 × 109 |
| median | 1.3492497 × 109 |
| Q3 | 1.3593854 × 109 |
| 95-th percentile | 1.3698306 × 109 |
| Maximum | 1.3718168 × 109 |
| Range | 46440799 |
| Interquartile range (IQR) | 20634633 |
Descriptive statistics
| Standard deviation | 12841278 |
|---|---|
| Coefficient of variation (CV) | 0.0095173904 |
| Kurtosis | -1.0875405 |
| Mean | 1.3492436 × 109 |
| Median Absolute Deviation (MAD) | 10358807 |
| Skewness | 0.0033779498 |
| Sum | 1.7495305 × 1015 |
| Variance | 1.6489843 × 1014 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1370177227 | 4 | < 0.1% |
| 1335110521 | 4 | < 0.1% |
| 1370050667 | 4 | < 0.1% |
| 1367602155 | 3 | < 0.1% |
| 1364686521 | 3 | < 0.1% |
| 1369587838 | 3 | < 0.1% |
| 1337306743 | 3 | < 0.1% |
| 1343668520 | 3 | < 0.1% |
| 1341944714 | 3 | < 0.1% |
| 1340650327 | 3 | < 0.1% |
| Other values (1274813) | 1296642 |
| Value | Count | Frequency (%) |
| 1325376018 | 1 | |
| 1325376044 | 1 | |
| 1325376051 | 1 | |
| 1325376076 | 1 | |
| 1325376186 | 1 | |
| 1325376248 | 1 | |
| 1325376282 | 1 | |
| 1325376308 | 1 | |
| 1325376318 | 1 | |
| 1325376361 | 1 |
| Value | Count | Frequency (%) |
| 1371816817 | 1 | |
| 1371816816 | 1 | |
| 1371816752 | 1 | |
| 1371816739 | 1 | |
| 1371816728 | 1 | |
| 1371816696 | 1 | |
| 1371816683 | 1 | |
| 1371816656 | 1 | |
| 1371816562 | 1 | |
| 1371816522 | 1 |
merch_lat
Real number (ℝ)
| Distinct | 1247805 |
|---|---|
| Distinct (%) | 96.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.537338 |
| Minimum | 19.027785 |
|---|---|
| Maximum | 67.510267 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 19.027785 |
|---|---|
| 5-th percentile | 29.751653 |
| Q1 | 34.733572 |
| median | 39.36568 |
| Q3 | 41.957164 |
| 95-th percentile | 46.00353 |
| Maximum | 67.510267 |
| Range | 48.482482 |
| Interquartile range (IQR) | 7.223592 |
Descriptive statistics
| Standard deviation | 5.1097884 |
|---|---|
| Coefficient of variation (CV) | 0.13259318 |
| Kurtosis | 0.79599391 |
| Mean | 38.537338 |
| Median Absolute Deviation (MAD) | 3.397536 |
| Skewness | -0.18191543 |
| Sum | 49970403 |
| Variance | 26.109937 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 41.305966 | 4 | < 0.1% |
| 41.937796 | 4 | < 0.1% |
| 42.265012 | 4 | < 0.1% |
| 41.301611 | 4 | < 0.1% |
| 34.134994 | 4 | < 0.1% |
| 37.669788 | 4 | < 0.1% |
| 39.348185 | 4 | < 0.1% |
| 32.64469 | 4 | < 0.1% |
| 42.749184 | 4 | < 0.1% |
| 38.050673 | 4 | < 0.1% |
| Other values (1247795) | 1296635 |
| Value | Count | Frequency (%) |
| 19.027785 | 1 | |
| 19.027804 | 1 | |
| 19.029798 | 1 | |
| 19.031242 | 1 | |
| 19.032277 | 1 | |
| 19.033288 | 1 | |
| 19.034282 | 1 | |
| 19.034687 | 1 | |
| 19.035472 | 1 | |
| 19.036312 | 1 |
| Value | Count | Frequency (%) |
| 67.510267 | 1 | |
| 67.441518 | 1 | |
| 67.397018 | 1 | |
| 67.188111 | 1 | |
| 67.064277 | 1 | |
| 66.835174 | 1 | |
| 66.682905 | 1 | |
| 66.67355 | 1 | |
| 66.664673 | 1 | |
| 66.659242 | 1 |
merch_long
Real number (ℝ)
| Distinct | 1275745 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.226465 |
| Minimum | -166.67124 |
|---|---|
| Maximum | -66.950902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1296675 |
| Negative (%) | 100.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | -166.67124 |
|---|---|
| 5-th percentile | -119.33009 |
| Q1 | -96.897276 |
| median | -87.438392 |
| Q3 | -80.236796 |
| 95-th percentile | -73.354218 |
| Maximum | -66.950902 |
| Range | 99.72034 |
| Interquartile range (IQR) | 16.660479 |
Descriptive statistics
| Standard deviation | 13.771091 |
|---|---|
| Coefficient of variation (CV) | -0.15262806 |
| Kurtosis | 1.8484792 |
| Mean | -90.226465 |
| Median Absolute Deviation (MAD) | 8.227889 |
| Skewness | -1.1469599 |
| Sum | -1.169944 × 108 |
| Variance | 189.64294 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -87.116414 | 4 | < 0.1% |
| -81.219189 | 4 | < 0.1% |
| -74.618269 | 4 | < 0.1% |
| -85.326323 | 3 | < 0.1% |
| -84.890305 | 3 | < 0.1% |
| -88.49309 | 3 | < 0.1% |
| -84.100102 | 3 | < 0.1% |
| -97.527227 | 3 | < 0.1% |
| -85.3444 | 3 | < 0.1% |
| -86.037494 | 3 | < 0.1% |
| Other values (1275735) | 1296642 |
| Value | Count | Frequency (%) |
| -166.671242 | 1 | |
| -166.670132 | 1 | |
| -166.669638 | 1 | |
| -166.666179 | 1 | |
| -166.664828 | 1 | |
| -166.662888 | 1 | |
| -166.661968 | 1 | |
| -166.659277 | 1 | |
| -166.657834 | 1 | |
| -166.657174 | 1 |
| Value | Count | Frequency (%) |
| -66.950902 | 1 | |
| -66.955996 | 1 | |
| -66.95654 | 1 | |
| -66.958659 | 1 | |
| -66.958751 | 1 | |
| -66.959178 | 1 | |
| -66.961923 | 1 | |
| -66.962913 | 1 | |
| -66.963918 | 1 | |
| -66.963975 | 1 |
is_fraud
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| 0 | |
|---|---|
| 1 | 7506 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1296675 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1289169 | |
| 1 | 7506 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1289169 | |
| 1 | 7506 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1289169 | |
| 1 | 7506 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1296675 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1289169 | |
| 1 | 7506 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1296675 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1289169 | |
| 1 | 7506 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1296675 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1289169 | |
| 1 | 7506 | 0.6% |
day
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.587978 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 15 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.8291214 |
|---|---|
| Coefficient of variation (CV) | 0.5664058 |
| Kurtosis | -1.1871417 |
| Mean | 15.587978 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.030847364 |
| Sum | 20212542 |
| Variance | 77.953384 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 47089 | 3.6% |
| 15 | 46213 | 3.6% |
| 8 | 46201 | 3.6% |
| 16 | 44894 | 3.5% |
| 2 | 44748 | 3.5% |
| 9 | 44685 | 3.4% |
| 7 | 44239 | 3.4% |
| 14 | 44015 | 3.4% |
| 28 | 43470 | 3.4% |
| 17 | 42272 | 3.3% |
| Other values (21) | 848849 |
| Value | Count | Frequency (%) |
| 1 | 47089 | |
| 2 | 44748 | |
| 3 | 41842 | |
| 4 | 41479 | |
| 5 | 41886 | |
| 6 | 41420 | |
| 7 | 44239 | |
| 8 | 46201 | |
| 9 | 44685 | |
| 10 | 41934 |
| Value | Count | Frequency (%) |
| 31 | 24701 | |
| 30 | 41019 | |
| 29 | 39617 | |
| 28 | 43470 | |
| 27 | 39684 | |
| 26 | 40692 | |
| 25 | 40374 | |
| 24 | 41360 | |
| 23 | 40815 | |
| 22 | 42061 |
day_of_week
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0706037 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 254282 |
| Zeros (%) | 19.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.1981526 |
|---|---|
| Coefficient of variation (CV) | 0.71586984 |
| Kurtosis | -1.445049 |
| Mean | 3.0706037 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.078453041 |
| Sum | 3981575 |
| Variance | 4.8318747 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 254282 | |
| 6 | 250579 | |
| 5 | 200957 | |
| 1 | 160227 | |
| 4 | 152272 | |
| 3 | 147285 | |
| 2 | 131073 |
| Value | Count | Frequency (%) |
| 0 | 254282 | |
| 1 | 160227 | |
| 2 | 131073 | |
| 3 | 147285 | |
| 4 | 152272 | |
| 5 | 200957 | |
| 6 | 250579 |
| Value | Count | Frequency (%) |
| 6 | 250579 | |
| 5 | 200957 | |
| 4 | 152272 | |
| 3 | 147285 | |
| 2 | 131073 | |
| 1 | 160227 | |
| 0 | 254282 |
age
Real number (ℝ)
| Distinct | 81 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.742545 |
| Minimum | 15 |
|---|---|
| Maximum | 96 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 33 |
| median | 45 |
| Q3 | 58 |
| 95-th percentile | 81 |
| Maximum | 96 |
| Range | 81 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 17.378485 |
|---|---|
| Coefficient of variation (CV) | 0.37179158 |
| Kurtosis | -0.1763463 |
| Mean | 46.742545 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.61235845 |
| Sum | 60609890 |
| Variance | 302.01173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 45483 | 3.5% |
| 36 | 40038 | 3.1% |
| 33 | 37481 | 2.9% |
| 35 | 37313 | 2.9% |
| 46 | 34299 | 2.6% |
| 44 | 32706 | 2.5% |
| 34 | 31851 | 2.5% |
| 30 | 31386 | 2.4% |
| 47 | 31271 | 2.4% |
| 32 | 30718 | 2.4% |
| Other values (71) | 944129 |
| Value | Count | Frequency (%) |
| 15 | 1959 | 0.2% |
| 16 | 7496 | 0.6% |
| 17 | 3975 | 0.3% |
| 19 | 5603 | 0.4% |
| 20 | 9530 | 0.7% |
| 21 | 18827 | |
| 22 | 13241 | |
| 23 | 29689 | |
| 24 | 6008 | 0.5% |
| 25 | 20573 |
| Value | Count | Frequency (%) |
| 96 | 536 | < 0.1% |
| 95 | 11 | < 0.1% |
| 94 | 6063 | |
| 93 | 4645 | |
| 92 | 4131 | |
| 91 | 6224 | |
| 90 | 3605 | |
| 89 | 4610 | |
| 88 | 2096 | 0.2% |
| 87 | 3041 |
merchant_encoded
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 693 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 342.85849 |
| Minimum | 0 |
|---|---|
| Maximum | 692 |
| Zeros | 1844 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 165 |
| median | 346 |
| Q3 | 514 |
| 95-th percentile | 659 |
| Maximum | 692 |
| Range | 692 |
| Interquartile range (IQR) | 349 |
Descriptive statistics
| Standard deviation | 200.9519 |
|---|---|
| Coefficient of variation (CV) | 0.58610739 |
| Kurtosis | -1.215053 |
| Mean | 342.85849 |
| Median Absolute Deviation (MAD) | 175 |
| Skewness | 0.0086598641 |
| Sum | 4.4457604 × 108 |
| Variance | 40381.666 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 316 | 4403 | 0.3% |
| 105 | 3649 | 0.3% |
| 571 | 3634 | 0.3% |
| 349 | 3510 | 0.3% |
| 70 | 3493 | 0.3% |
| 136 | 3434 | 0.3% |
| 117 | 2736 | 0.2% |
| 358 | 2734 | 0.2% |
| 463 | 2723 | 0.2% |
| 607 | 2721 | 0.2% |
| Other values (683) | 1263638 |
| Value | Count | Frequency (%) |
| 0 | 1844 | |
| 1 | 1763 | |
| 2 | 1751 | |
| 3 | 1895 | |
| 4 | 940 | 0.1% |
| 5 | 1746 | |
| 6 | 1904 | |
| 7 | 2503 | |
| 8 | 1923 | |
| 9 | 821 | 0.1% |
| Value | Count | Frequency (%) |
| 692 | 1783 | |
| 691 | 2560 | |
| 690 | 1695 | |
| 689 | 1804 | |
| 688 | 1297 | |
| 687 | 2017 | |
| 686 | 1870 | |
| 685 | 1766 | |
| 684 | 1872 | |
| 683 | 2358 |
hour
Real number (ℝ)
ZEROS 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.804858 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 42502 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 7 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 23 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 6.8178239 |
|---|---|
| Coefficient of variation (CV) | 0.53244042 |
| Kurtosis | -1.0795803 |
| Mean | 12.804858 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.28282545 |
| Sum | 16603739 |
| Variance | 46.482723 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 67104 | 5.2% |
| 22 | 66982 | 5.2% |
| 18 | 66051 | 5.1% |
| 16 | 65726 | 5.1% |
| 21 | 65533 | 5.1% |
| 19 | 65508 | 5.1% |
| 17 | 65450 | 5.0% |
| 15 | 65391 | 5.0% |
| 13 | 65314 | 5.0% |
| 12 | 65257 | 5.0% |
| Other values (14) | 638359 |
| Value | Count | Frequency (%) |
| 0 | 42502 | |
| 1 | 42869 | |
| 2 | 42656 | |
| 3 | 42769 | |
| 4 | 41863 | |
| 5 | 42171 | |
| 6 | 42300 | |
| 7 | 42203 | |
| 8 | 42505 | |
| 9 | 42185 |
| Value | Count | Frequency (%) |
| 23 | 67104 | |
| 22 | 66982 | |
| 21 | 65533 | |
| 20 | 65098 | |
| 19 | 65508 | |
| 18 | 66051 | |
| 17 | 65450 | |
| 16 | 65726 | |
| 15 | 65391 | |
| 14 | 64885 |
time_diff
Real number (ℝ)
| Distinct | 158975 |
|---|---|
| Distinct (%) | 12.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32460.389 |
| Minimum | 0 |
|---|---|
| Maximum | 1341471 |
| Zeros | 1003 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 969 |
| Q1 | 6004 |
| median | 16563 |
| Q3 | 40239 |
| 95-th percentile | 113905 |
| Maximum | 1341471 |
| Range | 1341471 |
| Interquartile range (IQR) | 34235 |
Descriptive statistics
| Standard deviation | 47331.145 |
|---|---|
| Coefficient of variation (CV) | 1.4581201 |
| Kurtosis | 31.873749 |
| Mean | 32460.389 |
| Median Absolute Deviation (MAD) | 12924 |
| Skewness | 4.2732438 |
| Sum | 4.2090574 × 1010 |
| Variance | 2.2402373 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1003 | 0.1% |
| 221 | 91 | < 0.1% |
| 290 | 90 | < 0.1% |
| 118 | 90 | < 0.1% |
| 445 | 89 | < 0.1% |
| 572 | 88 | < 0.1% |
| 821 | 87 | < 0.1% |
| 11 | 87 | < 0.1% |
| 136 | 86 | < 0.1% |
| 379 | 86 | < 0.1% |
| Other values (158965) | 1294878 |
| Value | Count | Frequency (%) |
| 0 | 1003 | |
| 1 | 62 | < 0.1% |
| 2 | 81 | < 0.1% |
| 3 | 76 | < 0.1% |
| 4 | 57 | < 0.1% |
| 5 | 67 | < 0.1% |
| 6 | 67 | < 0.1% |
| 7 | 64 | < 0.1% |
| 8 | 72 | < 0.1% |
| 9 | 72 | < 0.1% |
| Value | Count | Frequency (%) |
| 1341471 | 1 | |
| 1205687 | 1 | |
| 1107569 | 1 | |
| 1096094 | 1 | |
| 1060731 | 1 | |
| 1053269 | 1 | |
| 1045690 | 1 | |
| 1039152 | 1 | |
| 1032247 | 1 | |
| 1016241 | 1 |
day_of_month
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.589412 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 15 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.8312176 |
|---|---|
| Coefficient of variation (CV) | 0.56648818 |
| Kurtosis | -1.1868502 |
| Mean | 15.589412 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.031380011 |
| Sum | 20214401 |
| Variance | 77.990405 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 47089 | 3.6% |
| 15 | 46213 | 3.6% |
| 8 | 46201 | 3.6% |
| 16 | 44894 | 3.5% |
| 2 | 44748 | 3.5% |
| 9 | 44685 | 3.4% |
| 7 | 44239 | 3.4% |
| 14 | 44015 | 3.4% |
| 17 | 42272 | 3.3% |
| 22 | 42061 | 3.2% |
| Other values (21) | 850258 |
| Value | Count | Frequency (%) |
| 1 | 47089 | |
| 2 | 44748 | |
| 3 | 41842 | |
| 4 | 41479 | |
| 5 | 41886 | |
| 6 | 41420 | |
| 7 | 44239 | |
| 8 | 46201 | |
| 9 | 44685 | |
| 10 | 41934 |
| Value | Count | Frequency (%) |
| 31 | 24701 | |
| 30 | 41019 | |
| 29 | 41476 | |
| 28 | 41611 | |
| 27 | 39684 | |
| 26 | 40692 | |
| 25 | 40374 | |
| 24 | 41360 | |
| 23 | 40815 | |
| 22 | 42061 |
is_weekend
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1296675 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 845139 | |
| 1 | 451536 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 845139 | |
| 1 | 451536 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 845139 | |
| 1 | 451536 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1296675 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 845139 | |
| 1 | 451536 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1296675 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 845139 | |
| 1 | 451536 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1296675 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 845139 | |
| 1 | 451536 |
amt_ratio
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 1166530 |
|---|---|
| Distinct (%) | 90.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1 |
| Minimum | 0.0086933687 |
|---|---|
| Maximum | 386.48159 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 0.0086933687 |
|---|---|
| 5-th percentile | 0.037613054 |
| Q1 | 0.15441537 |
| median | 0.66941509 |
| Q3 | 1.187412 |
| 95-th percentile | 2.620873 |
| Maximum | 386.48159 |
| Range | 386.47289 |
| Interquartile range (IQR) | 1.0329966 |
Descriptive statistics
| Standard deviation | 2.2999175 |
|---|---|
| Coefficient of variation (CV) | 2.2999175 |
| Kurtosis | 2845.425 |
| Mean | 1 |
| Median Absolute Deviation (MAD) | 0.51563683 |
| Skewness | 34.809554 |
| Sum | 1296675 |
| Variance | 5.2896205 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1697644044 | 9 | < 0.1% |
| 0.1066987452 | 9 | < 0.1% |
| 0.08787319111 | 8 | < 0.1% |
| 0.0790465957 | 8 | < 0.1% |
| 0.1549432174 | 8 | < 0.1% |
| 0.04983253947 | 7 | < 0.1% |
| 0.08567179375 | 7 | < 0.1% |
| 0.05196871342 | 7 | < 0.1% |
| 0.07718171096 | 7 | < 0.1% |
| 0.02356862305 | 7 | < 0.1% |
| Other values (1166520) | 1296598 |
| Value | Count | Frequency (%) |
| 0.008693368698 | 1 | |
| 0.00877907162 | 1 | |
| 0.00929940348 | 1 | |
| 0.009387522129 | 1 | |
| 0.009534706331 | 1 | |
| 0.009548024185 | 2 | |
| 0.009568974574 | 1 | |
| 0.009641632266 | 1 | |
| 0.009828848426 | 1 | |
| 0.009856043811 | 1 |
| Value | Count | Frequency (%) |
| 386.4815883 | 1 | |
| 358.5962394 | 1 | |
| 283.0207865 | 1 | |
| 248.4744115 | 1 | |
| 236.1516822 | 1 | |
| 235.729112 | 1 | |
| 235.4992914 | 1 | |
| 233.4279738 | 1 | |
| 219.4420112 | 1 | |
| 217.5342348 | 1 |
high_value
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.9 MiB |
| 0 | |
|---|---|
| 1 | 64822 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1296675 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1231853 | |
| 1 | 64822 | 5.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1231853 | |
| 1 | 64822 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1231853 | |
| 1 | 64822 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1296675 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1231853 | |
| 1 | 64822 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1296675 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1231853 | |
| 1 | 64822 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1296675 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1231853 | |
| 1 | 64822 | 5.0% |
category_food_dining
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 91461 |
| Value | Count | Frequency (%) |
| False | 1205214 | |
| True | 91461 | 7.1% |
category_gas_transport
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 1165016 | |
| True | 131659 | 10.2% |
category_grocery_net
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 45452 |
| Value | Count | Frequency (%) |
| False | 1251223 | |
| True | 45452 | 3.5% |
category_grocery_pos
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 1173037 | |
| True | 123638 | 9.5% |
category_health_fitness
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 85879 |
| Value | Count | Frequency (%) |
| False | 1210796 | |
| True | 85879 | 6.6% |
category_home
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 1173560 | |
| True | 123115 | 9.5% |
category_kids_pets
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 113035 |
| Value | Count | Frequency (%) |
| False | 1183640 | |
| True | 113035 | 8.7% |
category_misc_net
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 63287 |
| Value | Count | Frequency (%) |
| False | 1233388 | |
| True | 63287 | 4.9% |
category_misc_pos
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 79655 |
| Value | Count | Frequency (%) |
| False | 1217020 | |
| True | 79655 | 6.1% |
category_personal_care
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 90758 |
| Value | Count | Frequency (%) |
| False | 1205917 | |
| True | 90758 | 7.0% |
category_shopping_net
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 97543 |
| Value | Count | Frequency (%) |
| False | 1199132 | |
| True | 97543 | 7.5% |
category_shopping_pos
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 116672 |
| Value | Count | Frequency (%) |
| False | 1180003 | |
| True | 116672 | 9.0% |
category_travel
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| False | |
|---|---|
| True | 40507 |
| Value | Count | Frequency (%) |
| False | 1256168 | |
| True | 40507 | 3.1% |
amt_merchant_interaction
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 850172 |
|---|---|
| Distinct (%) | 65.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24035.621 |
| Minimum | 0 |
|---|---|
| Maximum | 15748202 |
| Zeros | 1844 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 382.3 |
| Q1 | 2552.1 |
| median | 10352.25 |
| Q3 | 29331.085 |
| 95-th percentile | 77275.25 |
| Maximum | 15748202 |
| Range | 15748202 |
| Interquartile range (IQR) | 26778.985 |
Descriptive statistics
| Standard deviation | 64788.516 |
|---|---|
| Coefficient of variation (CV) | 2.6955208 |
| Kurtosis | 6863.1684 |
| Mean | 24035.621 |
| Median Absolute Deviation (MAD) | 9173.73 |
| Skewness | 50.346708 |
| Sum | 3.1166389 × 1010 |
| Variance | 4.1975518 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1844 | 0.1% |
| 672 | 36 | < 0.1% |
| 1224 | 36 | < 0.1% |
| 756 | 35 | < 0.1% |
| 693 | 32 | < 0.1% |
| 1344 | 31 | < 0.1% |
| 1056 | 31 | < 0.1% |
| 1512 | 31 | < 0.1% |
| 1008 | 30 | < 0.1% |
| 315 | 30 | < 0.1% |
| Other values (850162) | 1294539 |
| Value | Count | Frequency (%) |
| 0 | 1844 | |
| 1 | 1 | < 0.1% |
| 1.01 | 1 | < 0.1% |
| 1.03 | 1 | < 0.1% |
| 1.04 | 1 | < 0.1% |
| 1.05 | 2 | < 0.1% |
| 1.06 | 2 | < 0.1% |
| 1.1 | 1 | < 0.1% |
| 1.11 | 3 | < 0.1% |
| 1.13 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 15748201.6 | 1 | |
| 11832531.84 | 1 | |
| 11715740.64 | 1 | |
| 11114186.04 | 1 | |
| 7866051.3 | 1 | |
| 7668280.95 | 1 | |
| 7448548.25 | 1 | |
| 7414617.32 | 1 | |
| 7126857.95 | 1 | |
| 6743415.28 | 1 |
amt_day_interaction
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 99344 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 215.66738 |
| Minimum | 0 |
|---|---|
| Maximum | 173693.4 |
| Zeros | 254282 |
| Zeros (%) | 19.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 8.25 |
| median | 73.29 |
| Q3 | 269.75 |
| 95-th percentile | 741.42 |
| Maximum | 173693.4 |
| Range | 173693.4 |
| Interquartile range (IQR) | 261.5 |
Descriptive statistics
| Standard deviation | 631.42542 |
|---|---|
| Coefficient of variation (CV) | 2.9277743 |
| Kurtosis | 10492.695 |
| Mean | 215.66738 |
| Median Absolute Deviation (MAD) | 73.29 |
| Skewness | 62.287145 |
| Sum | 2.796505 × 108 |
| Variance | 398698.06 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 254282 | 19.6% |
| 9 | 355 | < 0.1% |
| 7.5 | 334 | < 0.1% |
| 10.2 | 314 | < 0.1% |
| 12 | 313 | < 0.1% |
| 10.8 | 311 | < 0.1% |
| 6.24 | 301 | < 0.1% |
| 6 | 296 | < 0.1% |
| 9.48 | 291 | < 0.1% |
| 15 | 288 | < 0.1% |
| Other values (99334) | 1039590 |
| Value | Count | Frequency (%) |
| 0 | 254282 | |
| 1 | 36 | < 0.1% |
| 1.01 | 67 | < 0.1% |
| 1.02 | 62 | < 0.1% |
| 1.03 | 70 | < 0.1% |
| 1.04 | 53 | < 0.1% |
| 1.05 | 59 | < 0.1% |
| 1.06 | 60 | < 0.1% |
| 1.07 | 55 | < 0.1% |
| 1.08 | 65 | < 0.1% |
| Value | Count | Frequency (%) |
| 173693.4 | 1 | |
| 135598.85 | 1 | |
| 132720.6 | 1 | |
| 107383.44 | 1 | |
| 100347.76 | 1 | |
| 90282.18 | 1 | |
| 76725.06 | 1 | |
| 72338.2 | 1 | |
| 72036.25 | 1 | |
| 67684.2 | 1 |
amt_mean
Real number (ℝ)
| Distinct | 983 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.351035 |
| Minimum | 42.951671 |
|---|---|
| Maximum | 948.81818 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 42.951671 |
|---|---|
| 5-th percentile | 52.795807 |
| Q1 | 59.813649 |
| median | 65.09374 |
| Q3 | 83.277582 |
| 95-th percentile | 96.281225 |
| Maximum | 948.81818 |
| Range | 905.86651 |
| Interquartile range (IQR) | 23.463933 |
Descriptive statistics
| Standard deviation | 19.410291 |
|---|---|
| Coefficient of variation (CV) | 0.27590625 |
| Kurtosis | 459.14005 |
| Mean | 70.351035 |
| Median Absolute Deviation (MAD) | 7.1445792 |
| Skewness | 14.40415 |
| Sum | 91222429 |
| Variance | 376.75938 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 61.45309958 | 3123 | 0.2% |
| 55.35335575 | 3123 | 0.2% |
| 89.77494389 | 3119 | 0.2% |
| 57.5451941 | 3117 | 0.2% |
| 48.47894957 | 3113 | 0.2% |
| 52.44669344 | 3112 | 0.2% |
| 95.31727653 | 3110 | 0.2% |
| 52.78476344 | 3107 | 0.2% |
| 91.44027688 | 3106 | 0.2% |
| 89.6934118 | 3101 | 0.2% |
| Other values (973) | 1265544 |
| Value | Count | Frequency (%) |
| 42.951671 | 1538 | |
| 44.71734531 | 501 | < 0.1% |
| 46.88654145 | 1544 | |
| 46.93667091 | 1571 | |
| 46.98265945 | 2587 | |
| 47.17054602 | 2564 | |
| 47.18054799 | 3011 | |
| 47.94510046 | 2588 | |
| 48.47894957 | 3113 | |
| 48.48821547 | 2107 |
| Value | Count | Frequency (%) |
| 948.8181818 | 11 | |
| 918.4255556 | 9 | |
| 874.5057143 | 7 | |
| 858.48 | 8 | |
| 842.23125 | 8 | |
| 833.969 | 10 | |
| 810.2785714 | 7 | |
| 799.2133333 | 9 | |
| 778.5718182 | 11 | |
| 774.74375 | 8 |
amt_std
Real number (ℝ)
| Distinct | 983 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 141.34551 |
| Minimum | 60.247108 |
|---|---|
| Maximum | 1202.988 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.9 MiB |
Quantile statistics
| Minimum | 60.247108 |
|---|---|
| 5-th percentile | 83.708636 |
| Q1 | 103.59066 |
| median | 122.37808 |
| Q3 | 149.71688 |
| 95-th percentile | 266.56177 |
| Maximum | 1202.988 |
| Range | 1142.7409 |
| Interquartile range (IQR) | 46.126219 |
Descriptive statistics
| Standard deviation | 73.303097 |
|---|---|
| Coefficient of variation (CV) | 0.51860931 |
| Kurtosis | 46.700004 |
| Mean | 141.34551 |
| Median Absolute Deviation (MAD) | 21.853523 |
| Skewness | 5.0391525 |
| Sum | 1.8327919 × 108 |
| Variance | 5373.344 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 143.1626102 | 3123 | 0.2% |
| 121.6611149 | 3123 | 0.2% |
| 118.4502102 | 3119 | 0.2% |
| 211.4021978 | 3117 | 0.2% |
| 131.566918 | 3113 | 0.2% |
| 155.4128126 | 3112 | 0.2% |
| 216.5906992 | 3110 | 0.2% |
| 140.3830389 | 3107 | 0.2% |
| 132.7622028 | 3106 | 0.2% |
| 137.017678 | 3101 | 0.2% |
| Other values (973) | 1265544 |
| Value | Count | Frequency (%) |
| 60.24710813 | 504 | < 0.1% |
| 64.15872518 | 471 | < 0.1% |
| 65.34697836 | 518 | < 0.1% |
| 65.53295778 | 972 | |
| 65.84396856 | 496 | < 0.1% |
| 66.98899721 | 525 | < 0.1% |
| 67.49926144 | 1494 | |
| 70.82320784 | 1005 | |
| 71.420119 | 485 | < 0.1% |
| 72.37512206 | 509 | < 0.1% |
| Value | Count | Frequency (%) |
| 1202.988005 | 510 | < 0.1% |
| 1165.824421 | 520 | < 0.1% |
| 867.2878545 | 1017 | 0.1% |
| 644.2690528 | 2050 | |
| 623.2522115 | 540 | < 0.1% |
| 543.2532612 | 13 | < 0.1% |
| 512.7832113 | 550 | < 0.1% |
| 510.8874515 | 2597 | |
| 483.3276455 | 1060 | |
| 482.145942 | 10 | < 0.1% |
| age | amt | amt_day_interaction | amt_mean | amt_merchant_interaction | amt_ratio | amt_std | category_food_dining | category_gas_transport | category_grocery_net | category_grocery_pos | category_health_fitness | category_home | category_kids_pets | category_misc_net | category_misc_pos | category_personal_care | category_shopping_net | category_shopping_pos | category_travel | city_pop | day | day_of_month | day_of_week | gender | high_value | hour | is_fraud | is_weekend | merch_lat | merch_long | merchant_encoded | time_diff | unix_time | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | -0.024 | -0.022 | 0.009 | -0.025 | -0.006 | -0.058 | 0.025 | 0.065 | 0.087 | 0.047 | 0.010 | 0.044 | 0.023 | 0.021 | 0.032 | 0.023 | 0.019 | 0.021 | 0.027 | -0.157 | 0.001 | 0.001 | -0.013 | 0.132 | 0.043 | -0.173 | 0.020 | 0.046 | 0.036 | -0.020 | -0.007 | 0.125 | -0.004 |
| amt | -0.024 | 1.000 | 0.579 | 0.227 | 0.811 | 0.979 | -0.086 | 0.004 | 0.005 | 0.002 | 0.005 | 0.004 | 0.005 | 0.005 | 0.003 | 0.004 | 0.004 | 0.009 | 0.011 | 0.053 | -0.024 | 0.000 | 0.000 | -0.001 | 0.000 | 0.074 | -0.154 | 0.000 | 0.001 | 0.012 | 0.000 | -0.012 | 0.030 | 0.001 |
| amt_day_interaction | -0.022 | 0.579 | 1.000 | 0.131 | 0.479 | 0.570 | -0.051 | 0.001 | 0.002 | 0.000 | 0.002 | 0.001 | 0.002 | 0.002 | 0.000 | 0.001 | 0.001 | 0.003 | 0.005 | 0.038 | -0.015 | 0.018 | 0.018 | 0.684 | 0.001 | 0.046 | -0.088 | 0.000 | 0.008 | 0.008 | 0.001 | -0.006 | 0.062 | -0.018 |
| amt_mean | 0.009 | 0.227 | 0.131 | 1.000 | 0.188 | 0.057 | 0.054 | 0.005 | 0.004 | 0.002 | 0.015 | 0.006 | 0.007 | 0.006 | 0.012 | 0.003 | 0.005 | 0.021 | 0.003 | 0.001 | 0.097 | -0.000 | -0.000 | -0.003 | 0.010 | 0.085 | -0.044 | 0.313 | 0.005 | -0.025 | 0.007 | 0.004 | 0.064 | 0.002 |
| amt_merchant_interaction | -0.025 | 0.811 | 0.479 | 0.188 | 1.000 | 0.798 | -0.069 | 0.003 | 0.003 | 0.001 | 0.003 | 0.002 | 0.003 | 0.003 | 0.000 | 0.002 | 0.003 | 0.007 | 0.006 | 0.039 | -0.017 | 0.000 | 0.000 | -0.000 | 0.000 | 0.053 | -0.134 | 0.000 | 0.000 | 0.009 | -0.001 | 0.500 | 0.026 | -0.000 |
| amt_ratio | -0.006 | 0.979 | 0.570 | 0.057 | 0.798 | 1.000 | -0.076 | 0.005 | 0.006 | 0.003 | 0.006 | 0.005 | 0.006 | 0.006 | 0.002 | 0.004 | 0.005 | 0.014 | 0.016 | 0.052 | -0.027 | 0.000 | 0.000 | 0.000 | 0.000 | 0.086 | -0.166 | 0.000 | 0.001 | 0.011 | -0.001 | -0.013 | 0.025 | 0.000 |
| amt_std | -0.058 | -0.086 | -0.051 | 0.054 | -0.069 | -0.076 | 1.000 | 0.003 | 0.014 | 0.011 | 0.005 | 0.005 | 0.005 | 0.004 | 0.003 | 0.004 | 0.003 | 0.004 | 0.005 | 0.004 | 0.229 | 0.001 | 0.001 | -0.002 | 0.112 | 0.008 | 0.038 | 0.044 | 0.003 | -0.069 | 0.037 | 0.002 | -0.040 | -0.003 |
| category_food_dining | 0.025 | 0.004 | 0.001 | 0.005 | 0.003 | 0.005 | 0.003 | 1.000 | 0.093 | 0.052 | 0.089 | 0.073 | 0.089 | 0.085 | 0.062 | 0.070 | 0.076 | 0.079 | 0.087 | 0.049 | 0.005 | 0.000 | 0.000 | 0.003 | 0.010 | 0.042 | 0.160 | 0.015 | 0.001 | 0.002 | 0.007 | 0.105 | 0.001 | 0.003 |
| category_gas_transport | 0.065 | 0.005 | 0.002 | 0.004 | 0.003 | 0.006 | 0.014 | 0.093 | 1.000 | 0.064 | 0.109 | 0.090 | 0.109 | 0.104 | 0.076 | 0.086 | 0.092 | 0.096 | 0.106 | 0.060 | 0.026 | 0.001 | 0.001 | 0.003 | 0.003 | 0.077 | 0.419 | 0.005 | 0.003 | 0.021 | 0.014 | 0.123 | 0.000 | 0.000 |
| category_grocery_net | 0.087 | 0.002 | 0.000 | 0.002 | 0.001 | 0.003 | 0.011 | 0.052 | 0.064 | 1.000 | 0.062 | 0.051 | 0.062 | 0.059 | 0.043 | 0.049 | 0.052 | 0.054 | 0.060 | 0.034 | 0.018 | 0.001 | 0.001 | 0.002 | 0.003 | 0.044 | 0.237 | 0.007 | 0.000 | 0.017 | 0.012 | 0.068 | 0.007 | 0.000 |
| category_grocery_pos | 0.047 | 0.005 | 0.002 | 0.015 | 0.003 | 0.006 | 0.005 | 0.089 | 0.109 | 0.062 | 1.000 | 0.086 | 0.105 | 0.100 | 0.074 | 0.083 | 0.089 | 0.093 | 0.102 | 0.058 | 0.007 | 0.000 | 0.000 | 0.003 | 0.012 | 0.083 | 0.391 | 0.036 | 0.002 | 0.003 | 0.005 | 0.100 | 0.000 | 0.003 |
| category_health_fitness | 0.010 | 0.004 | 0.001 | 0.006 | 0.002 | 0.005 | 0.005 | 0.073 | 0.090 | 0.051 | 0.086 | 1.000 | 0.086 | 0.082 | 0.060 | 0.068 | 0.073 | 0.076 | 0.084 | 0.048 | 0.003 | 0.001 | 0.001 | 0.001 | 0.011 | 0.038 | 0.214 | 0.015 | 0.002 | 0.005 | 0.004 | 0.140 | 0.000 | 0.000 |
| category_home | 0.044 | 0.005 | 0.002 | 0.007 | 0.003 | 0.006 | 0.005 | 0.089 | 0.109 | 0.062 | 0.105 | 0.086 | 1.000 | 0.100 | 0.073 | 0.083 | 0.089 | 0.092 | 0.102 | 0.058 | 0.006 | 0.001 | 0.000 | 0.004 | 0.011 | 0.044 | 0.260 | 0.018 | 0.004 | 0.004 | 0.003 | 0.079 | 0.000 | 0.001 |
| category_kids_pets | 0.023 | 0.005 | 0.002 | 0.006 | 0.003 | 0.006 | 0.004 | 0.085 | 0.104 | 0.059 | 0.100 | 0.082 | 0.100 | 1.000 | 0.070 | 0.079 | 0.085 | 0.088 | 0.097 | 0.055 | 0.000 | 0.001 | 0.001 | 0.002 | 0.005 | 0.042 | 0.248 | 0.015 | 0.002 | 0.003 | 0.001 | 0.137 | 0.000 | 0.000 |
| category_misc_net | 0.021 | 0.003 | 0.000 | 0.012 | 0.000 | 0.002 | 0.003 | 0.062 | 0.076 | 0.043 | 0.074 | 0.060 | 0.073 | 0.070 | 1.000 | 0.058 | 0.062 | 0.065 | 0.071 | 0.041 | 0.004 | 0.002 | 0.002 | 0.002 | 0.007 | 0.059 | 0.196 | 0.026 | 0.000 | 0.005 | 0.005 | 0.101 | 0.002 | 0.002 |
| category_misc_pos | 0.032 | 0.004 | 0.001 | 0.003 | 0.002 | 0.004 | 0.004 | 0.070 | 0.086 | 0.049 | 0.083 | 0.068 | 0.083 | 0.079 | 0.058 | 1.000 | 0.070 | 0.073 | 0.080 | 0.046 | 0.003 | 0.000 | 0.000 | 0.004 | 0.008 | 0.040 | 0.094 | 0.009 | 0.004 | 0.004 | 0.008 | 0.139 | 0.002 | 0.000 |
| category_personal_care | 0.023 | 0.004 | 0.001 | 0.005 | 0.003 | 0.005 | 0.003 | 0.076 | 0.092 | 0.052 | 0.089 | 0.073 | 0.089 | 0.085 | 0.062 | 0.070 | 1.000 | 0.078 | 0.086 | 0.049 | 0.002 | 0.001 | 0.000 | 0.002 | 0.034 | 0.040 | 0.220 | 0.012 | 0.001 | 0.004 | 0.001 | 0.071 | 0.000 | 0.000 |
| category_shopping_net | 0.019 | 0.009 | 0.003 | 0.021 | 0.007 | 0.014 | 0.004 | 0.079 | 0.096 | 0.054 | 0.093 | 0.076 | 0.092 | 0.088 | 0.065 | 0.073 | 0.078 | 1.000 | 0.090 | 0.051 | 0.009 | 0.002 | 0.002 | 0.000 | 0.011 | 0.064 | 0.019 | 0.044 | 0.000 | 0.003 | 0.006 | 0.098 | 0.003 | 0.001 |
| category_shopping_pos | 0.021 | 0.011 | 0.005 | 0.003 | 0.006 | 0.016 | 0.005 | 0.087 | 0.106 | 0.060 | 0.102 | 0.084 | 0.102 | 0.097 | 0.071 | 0.080 | 0.086 | 0.090 | 1.000 | 0.056 | 0.011 | 0.002 | 0.002 | 0.000 | 0.021 | 0.047 | 0.003 | 0.006 | 0.000 | 0.009 | 0.005 | 0.132 | 0.002 | 0.000 |
| category_travel | 0.027 | 0.053 | 0.038 | 0.001 | 0.039 | 0.052 | 0.004 | 0.049 | 0.060 | 0.034 | 0.058 | 0.048 | 0.058 | 0.055 | 0.041 | 0.046 | 0.049 | 0.051 | 0.056 | 1.000 | 0.006 | 0.000 | 0.000 | 0.003 | 0.018 | 0.067 | 0.144 | 0.007 | 0.002 | 0.005 | 0.004 | 0.059 | 0.003 | 0.000 |
| city_pop | -0.157 | -0.024 | -0.015 | 0.097 | -0.017 | -0.027 | 0.229 | 0.005 | 0.026 | 0.018 | 0.007 | 0.003 | 0.006 | 0.000 | 0.004 | 0.003 | 0.002 | 0.009 | 0.011 | 0.006 | 1.000 | -0.001 | -0.001 | 0.002 | 0.089 | 0.032 | 0.033 | 0.004 | 0.008 | -0.264 | 0.086 | 0.004 | -0.009 | -0.003 |
| day | 0.001 | 0.000 | 0.018 | -0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 0.001 | 0.001 | 0.000 | 0.001 | 0.001 | 0.001 | 0.002 | 0.000 | 0.001 | 0.002 | 0.002 | 0.000 | -0.001 | 1.000 | 1.000 | 0.017 | 0.000 | 0.002 | -0.000 | 0.009 | 0.070 | -0.000 | 0.000 | -0.001 | -0.002 | 0.019 |
| day_of_month | 0.001 | 0.000 | 0.018 | -0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 0.001 | 0.001 | 0.000 | 0.001 | 0.000 | 0.001 | 0.002 | 0.000 | 0.000 | 0.002 | 0.002 | 0.000 | -0.001 | 1.000 | 1.000 | 0.017 | 0.000 | 0.002 | -0.000 | 0.009 | 0.069 | -0.000 | 0.000 | -0.001 | -0.002 | 0.019 |
| day_of_week | -0.013 | -0.001 | 0.684 | -0.003 | -0.000 | 0.000 | -0.002 | 0.003 | 0.003 | 0.002 | 0.003 | 0.001 | 0.004 | 0.002 | 0.002 | 0.004 | 0.002 | 0.000 | 0.000 | 0.003 | 0.002 | 0.017 | 0.017 | 1.000 | 0.006 | 0.002 | 0.000 | 0.012 | 1.000 | 0.000 | 0.001 | 0.001 | -0.011 | -0.029 |
| gender | 0.132 | 0.000 | 0.001 | 0.010 | 0.000 | 0.000 | 0.112 | 0.010 | 0.003 | 0.003 | 0.012 | 0.011 | 0.011 | 0.005 | 0.007 | 0.008 | 0.034 | 0.011 | 0.021 | 0.018 | 0.089 | 0.000 | 0.000 | 0.006 | 1.000 | 0.047 | 0.045 | 0.008 | 0.004 | 0.103 | 0.082 | 0.006 | 0.031 | 0.000 |
| high_value | 0.043 | 0.074 | 0.046 | 0.085 | 0.053 | 0.086 | 0.008 | 0.042 | 0.077 | 0.044 | 0.083 | 0.038 | 0.044 | 0.042 | 0.059 | 0.040 | 0.040 | 0.064 | 0.047 | 0.067 | 0.032 | 0.002 | 0.002 | 0.002 | 0.047 | 1.000 | 0.030 | 0.249 | 0.001 | 0.022 | 0.013 | 0.023 | 0.007 | 0.003 |
| hour | -0.173 | -0.154 | -0.088 | -0.044 | -0.134 | -0.166 | 0.038 | 0.160 | 0.419 | 0.237 | 0.391 | 0.214 | 0.260 | 0.248 | 0.196 | 0.094 | 0.220 | 0.019 | 0.003 | 0.144 | 0.033 | -0.000 | -0.000 | 0.000 | 0.045 | 0.030 | 1.000 | 0.095 | 0.000 | -0.010 | -0.006 | -0.002 | -0.120 | 0.001 |
| is_fraud | 0.020 | 0.000 | 0.000 | 0.313 | 0.000 | 0.000 | 0.044 | 0.015 | 0.005 | 0.007 | 0.036 | 0.015 | 0.018 | 0.015 | 0.026 | 0.009 | 0.012 | 0.044 | 0.006 | 0.007 | 0.004 | 0.009 | 0.009 | 0.012 | 0.008 | 0.249 | 0.095 | 1.000 | 0.004 | 0.008 | 0.005 | 0.010 | 0.010 | 0.018 |
| is_weekend | 0.046 | 0.001 | 0.008 | 0.005 | 0.000 | 0.001 | 0.003 | 0.001 | 0.003 | 0.000 | 0.002 | 0.002 | 0.004 | 0.002 | 0.000 | 0.004 | 0.001 | 0.000 | 0.000 | 0.002 | 0.008 | 0.070 | 0.069 | 1.000 | 0.004 | 0.001 | 0.000 | 0.004 | 1.000 | 0.004 | 0.005 | 0.001 | 0.025 | 0.083 |
| merch_lat | 0.036 | 0.012 | 0.008 | -0.025 | 0.009 | 0.011 | -0.069 | 0.002 | 0.021 | 0.017 | 0.003 | 0.005 | 0.004 | 0.003 | 0.005 | 0.004 | 0.004 | 0.003 | 0.009 | 0.005 | -0.264 | -0.000 | -0.000 | 0.000 | 0.103 | 0.022 | -0.010 | 0.008 | 0.004 | 1.000 | 0.104 | -0.002 | 0.012 | 0.001 |
| merch_long | -0.020 | 0.000 | 0.001 | 0.007 | -0.001 | -0.001 | 0.037 | 0.007 | 0.014 | 0.012 | 0.005 | 0.004 | 0.003 | 0.001 | 0.005 | 0.008 | 0.001 | 0.006 | 0.005 | 0.004 | 0.086 | 0.000 | 0.000 | 0.001 | 0.082 | 0.013 | -0.006 | 0.005 | 0.005 | 0.104 | 1.000 | -0.001 | 0.006 | -0.001 |
| merchant_encoded | -0.007 | -0.012 | -0.006 | 0.004 | 0.500 | -0.013 | 0.002 | 0.105 | 0.123 | 0.068 | 0.100 | 0.140 | 0.079 | 0.137 | 0.101 | 0.139 | 0.071 | 0.098 | 0.132 | 0.059 | 0.004 | -0.001 | -0.001 | 0.001 | 0.006 | 0.023 | -0.002 | 0.010 | 0.001 | -0.002 | -0.001 | 1.000 | -0.000 | -0.001 |
| time_diff | 0.125 | 0.030 | 0.062 | 0.064 | 0.026 | 0.025 | -0.040 | 0.001 | 0.000 | 0.007 | 0.000 | 0.000 | 0.000 | 0.000 | 0.002 | 0.002 | 0.000 | 0.003 | 0.002 | 0.003 | -0.009 | -0.002 | -0.002 | -0.011 | 0.031 | 0.007 | -0.120 | 0.010 | 0.025 | 0.012 | 0.006 | -0.000 | 1.000 | -0.035 |
| unix_time | -0.004 | 0.001 | -0.018 | 0.002 | -0.000 | 0.000 | -0.003 | 0.003 | 0.000 | 0.000 | 0.003 | 0.000 | 0.001 | 0.000 | 0.002 | 0.000 | 0.000 | 0.001 | 0.000 | 0.000 | -0.003 | 0.019 | 0.019 | -0.029 | 0.000 | 0.003 | 0.001 | 0.018 | 0.083 | 0.001 | -0.001 | -0.001 | -0.035 | 1.000 |
| amt | first | last | gender | street | city | state | city_pop | job | unix_time | merch_lat | merch_long | is_fraud | day | day_of_week | age | merchant_encoded | hour | time_diff | day_of_month | is_weekend | amt_ratio | high_value | category_food_dining | category_gas_transport | category_grocery_net | category_grocery_pos | category_health_fitness | category_home | category_kids_pets | category_misc_net | category_misc_pos | category_personal_care | category_shopping_net | category_shopping_pos | category_travel | amt_merchant_interaction | amt_day_interaction | amt_mean | amt_std | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 4.97 | Jennifer | Banks | F | 561 Perry Cove | Moravian Falls | NC | 3495 | Psychologist, counselling | 1325376018 | 36.011293 | -82.048315 | 0 | 1 | 1 | 32 | 514 | 0 | 0.0 | 1 | 0 | 0.056869 | 0 | False | False | False | False | False | False | False | True | False | False | False | False | False | 2554.58 | 4.97 | 87.393215 | 126.596221 |
| 1 | 107.23 | Stephanie | Gill | F | 43039 Riley Greens Suite 393 | Orient | WA | 149 | Special educational needs teacher | 1325376044 | 49.159047 | -118.186462 | 0 | 1 | 1 | 42 | 241 | 0 | 0.0 | 1 | 0 | 1.987606 | 0 | False | False | False | True | False | False | False | False | False | False | False | False | False | 25842.43 | 107.23 | 53.949320 | 118.337621 |
| 2 | 220.11 | Edward | Sanchez | M | 594 White Dale Suite 530 | Malad City | ID | 4154 | Nature conservation officer | 1325376051 | 43.150704 | -112.154481 | 0 | 1 | 1 | 58 | 390 | 0 | 0.0 | 1 | 0 | 3.341580 | 1 | False | False | False | False | False | False | False | False | False | False | False | False | False | 85842.90 | 220.11 | 65.870040 | 101.585754 |
| 3 | 45.00 | Jeremy | White | M | 9443 Cynthia Court Apt. 038 | Boulder | MT | 1939 | Patent attorney | 1325376076 | 47.034331 | -112.561071 | 0 | 1 | 1 | 53 | 360 | 0 | 0.0 | 1 | 0 | 0.618330 | 0 | False | True | False | False | False | False | False | False | False | False | False | False | False | 16200.00 | 45.00 | 72.776673 | 148.593473 |
| 4 | 41.96 | Tyler | Garcia | M | 408 Bradley Rest | Doe Hill | VA | 99 | Dance movement psychotherapist | 1325376186 | 38.674999 | -78.632459 | 0 | 1 | 1 | 34 | 297 | 0 | 0.0 | 1 | 0 | 0.440858 | 0 | False | False | False | False | False | False | False | False | True | False | False | False | False | 12462.12 | 41.96 | 95.178091 | 89.133972 |
| 5 | 94.63 | Jennifer | Conner | F | 4655 David Island | Dublin | PA | 2158 | Transport planner | 1325376248 | 40.653382 | -76.152667 | 0 | 1 | 1 | 59 | 607 | 0 | 0.0 | 1 | 0 | 1.446905 | 0 | False | True | False | False | False | False | False | False | False | False | False | False | False | 57440.41 | 94.63 | 65.401685 | 110.658809 |
| 6 | 44.54 | Kelsey | Richards | F | 889 Sarah Station Suite 624 | Holcomb | KS | 2691 | Arboriculturist | 1325376282 | 37.162705 | -100.153370 | 0 | 1 | 1 | 27 | 534 | 0 | 0.0 | 1 | 0 | 0.493300 | 0 | False | False | True | False | False | False | False | False | False | False | False | False | False | 23784.36 | 44.54 | 90.289835 | 129.427131 |
| 7 | 71.65 | Steven | Williams | M | 231 Flores Pass Suite 720 | Edinburg | VA | 6018 | Designer, multimedia | 1325376308 | 38.948089 | -78.540296 | 0 | 1 | 1 | 73 | 107 | 0 | 0.0 | 1 | 0 | 1.043926 | 0 | False | True | False | False | False | False | False | False | False | False | False | False | False | 7666.55 | 71.65 | 68.635163 | 113.994926 |
| 8 | 4.27 | Heather | Chase | F | 6888 Hicks Stream Suite 954 | Manor | PA | 1472 | Public affairs consultant | 1325376318 | 40.351813 | -79.958146 | 0 | 1 | 1 | 79 | 250 | 0 | 0.0 | 1 | 0 | 0.062252 | 0 | False | False | False | False | False | False | False | False | True | False | False | False | False | 1067.50 | 4.27 | 68.591883 | 117.228359 |
| 9 | 198.39 | Melissa | Aguilar | F | 21326 Taylor Squares Suite 708 | Clarksville | TN | 151785 | Pathologist | 1325376361 | 37.179198 | -87.485381 | 0 | 1 | 1 | 46 | 563 | 0 | 0.0 | 1 | 0 | 2.107354 | 1 | False | False | False | True | False | False | False | False | False | False | False | False | False | 111693.57 | 198.39 | 94.141775 | 133.033890 |
| amt | first | last | gender | street | city | state | city_pop | job | unix_time | merch_lat | merch_long | is_fraud | day | day_of_week | age | merchant_encoded | hour | time_diff | day_of_month | is_weekend | amt_ratio | high_value | category_food_dining | category_gas_transport | category_grocery_net | category_grocery_pos | category_health_fitness | category_home | category_kids_pets | category_misc_net | category_misc_pos | category_personal_care | category_shopping_net | category_shopping_pos | category_travel | amt_merchant_interaction | amt_day_interaction | amt_mean | amt_std | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1296665 | 72.17 | James | Hunt | M | 7369 Gabriel Tunnel | Pointe Aux Pins | MI | 95 | Electrical engineer | 1371816522 | 44.938461 | -83.996234 | 0 | 21 | 6 | 26 | 211 | 12 | 10659.0 | 21 | 1 | 0.762142 | 0 | False | False | False | False | False | True | False | False | False | False | False | False | False | 15227.87 | 433.02 | 94.693681 | 89.230833 |
| 1296666 | 7.30 | Amber | Lewis | F | 6296 John Keys Suite 858 | Pembroke Township | IL | 2135 | Psychotherapist, child | 1371816562 | 40.556811 | -88.092339 | 0 | 21 | 6 | 16 | 274 | 12 | 6401.0 | 21 | 1 | 0.101691 | 0 | False | False | False | False | True | False | False | False | False | False | False | False | False | 2000.20 | 43.80 | 71.786086 | 72.979539 |
| 1296667 | 19.71 | Christopher | Farrell | M | 97070 Anderson Land | Haines City | FL | 33804 | Exercise physiologist | 1371816656 | 27.465871 | -81.511804 | 0 | 21 | 6 | 29 | 221 | 12 | 13811.0 | 21 | 1 | 0.289941 | 0 | False | False | False | False | False | False | False | False | False | False | False | False | True | 4355.91 | 118.26 | 67.979325 | 210.676579 |
| 1296668 | 100.85 | Margaret | Curtis | F | 742 Oneill Shore | Florence | MS | 19685 | Fine artist | 1371816683 | 31.377697 | -90.528450 | 0 | 21 | 6 | 36 | 424 | 12 | 5451.0 | 21 | 1 | 1.130780 | 0 | False | False | False | False | False | False | True | False | False | False | False | False | False | 42760.40 | 605.10 | 89.186195 | 124.338770 |
| 1296669 | 37.38 | Marissa | Powell | F | 474 Allen Haven | North Loup | NE | 509 | Nurse, children's | 1371816696 | 41.728638 | -99.039660 | 0 | 21 | 6 | 40 | 598 | 12 | 72413.0 | 21 | 1 | 0.614222 | 0 | False | False | False | False | False | False | False | False | True | False | False | False | False | 22353.24 | 224.28 | 60.857433 | 151.348005 |
| 1296670 | 15.56 | Erik | Patterson | M | 162 Jessica Row Apt. 072 | Hatch | UT | 258 | Geoscientist | 1371816728 | 36.841266 | -111.690765 | 0 | 21 | 6 | 59 | 499 | 12 | 16781.0 | 21 | 1 | 0.246272 | 0 | False | False | False | False | False | False | False | False | False | False | False | False | False | 7764.44 | 93.36 | 63.182274 | 98.227403 |
| 1296671 | 51.70 | Jeffrey | White | M | 8617 Holmes Terrace Suite 651 | Tuscarora | MD | 100 | Production assistant, television | 1371816739 | 38.906881 | -78.246528 | 0 | 21 | 6 | 41 | 2 | 12 | 7962.0 | 21 | 1 | 0.511119 | 0 | True | False | False | False | False | False | False | False | False | False | False | False | False | 103.40 | 310.20 | 101.150621 | 115.992546 |
| 1296672 | 105.93 | Christopher | Castaneda | M | 1632 Cohen Drive Suite 639 | High Rolls Mountain Park | NM | 899 | Naval architect | 1371816752 | 33.619513 | -105.130529 | 0 | 21 | 6 | 53 | 599 | 12 | 29074.0 | 21 | 1 | 1.623797 | 0 | True | False | False | False | False | False | False | False | False | False | False | False | False | 63452.07 | 635.58 | 65.235995 | 131.805092 |
| 1296673 | 74.90 | Joseph | Murray | M | 42933 Ryan Underpass | Manderson | SD | 1126 | Volunteer coordinator | 1371816816 | 42.788940 | -103.241160 | 0 | 21 | 6 | 40 | 509 | 12 | 91018.0 | 21 | 1 | 0.782215 | 0 | True | False | False | False | False | False | False | False | False | False | False | False | False | 38124.10 | 449.40 | 95.753691 | 91.370450 |
| 1296674 | 4.30 | Jeffrey | Smith | M | 135 Joseph Mountains | Sula | MT | 218 | Therapist, horticultural | 1371816817 | 46.565983 | -114.186110 | 0 | 21 | 6 | 25 | 370 | 12 | 44250.0 | 21 | 1 | 0.062383 | 0 | True | False | False | False | False | False | False | False | False | False | False | False | False | 1591.00 | 25.80 | 68.929193 | 75.785422 |